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Abstract 

O - 

^s^j , It is demonstrated that doubling the sampling rate recovers some of the loss in capacity 

incurred on the bandlimited Gaussian channel with a one-bit output quantizer. 

Q-r 

1 Introduction 

m , 

We study the capacity of the continuous-time, bandlimited, additive white Gaussian noise 
(AWGN) channel with one-bit output quantization. Our focus is on the capacity at low transmit 
powers, i.e., on the capacity per unit-cost, which is defined as the slope of the capacity-vs-input- 
power curve at zero. We show that increasing the sampling rate reduces the loss in capacity per 
unit-cost caused by the quantization. 

The capacity of the continuous-time AWGN channel without output quantization was studied 
by Shannon [1]. He showed that if the channel input is bandlimited to W Hz and satisfies the 
average-power constraint P, and if the additive Gaussian noise is of double-sided power spectral 
density Nq/2, then the capacity (in nats per second) is given by (see also [2]) 
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where log(-) denotes the natural logarithm function. This capacity can be achieved by trans- 
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X(t)= Xesinc^Wt-l), (el (2) 
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(where R denotes the set of real numbers), and by sampling the output Y(-) at Nyquist rate 
2W. Here {Xg, £ g Z} (where Z denotes the set of integers) is a sequence of independent 
and identically distributed (IID) Gaussian random variables of zero mean and variance P, and 
t H> sinc(i) denotes the sine- function, i.e., 

r 1, t = 

sinc(t) = < sin(7rf) 



nt 



The above (capacity-achieving) transmission scheme reduces the continuous-time channel to 
a discrete-time AWGN channel with inputs {X e , £ e Z} and outputs {Y(£/(2W)), £&Z}. Yet, 
it is often required that the channel inputs and outputs be not only discrete in time, but also 
take on a discrete value, i.e., take value in a finite set rather than in R. This is, for example, 
the case if the transmitter and receiver use digital signal processing techniques. To ensure that 
the channel inputs are discrete- valued, we can simply restrict ourselves to finite input alphabets. 
This restriction is not critical for small input powers P. Indeed, it is well-known that binary 
inputs achieve the capacity per unit-cost of the AWGN channel [1]. To ensure that the channel 
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outputs are discrete- valued, we have to employ a quantizer (analog-to-digital converter), which 
approximates the continuous-valued output by a finite number of bits. 

The capacity (in nats per channel use) of the discrete-time AWGN channel with binary 
symmetric output quantization — where the quantizer produces 1 for a nonnegative output and 
— 1 for a negative output — is given by 

log 2 - H b (Q{^VJ^)) (3) 

where a 2 denotes the variance of the additive noise, -fffc(-) the binary entropy function, and 
Q(-) the Q-function; see [3, (3.4.18)], [4, p. 107], [5, Thm. 2]. To the best of our knowledge, 
there exists no closed-form expression for the capacity of the discrete-time AWGN channel 
with nonbinary output quantization. However, numerical results are, for example, given in [5]. 
Furthermore, there exist analytical results concerning the capacity per-unit cost. For example, 
it was demonstrated that if a binary symmetric quantizer is employed, then the capacity per 
unit-cost equals [3, (3.4.20)]. It was further demonstrated that for an octal quantizer with 
uniform quantization the capacity per unit-cost is not less than 0.475^2 [3, (3.4.21)]. Thus, at 
low transmit power, employing a binary quantizer causes a loss of a factor of 2/-7T relative to the 
capacity per unit-cost for unquantized decoding [1]. In contrast, by quantizing the output 
with 3 bits, a capacity per unit-cost can be achieved that is close to the capacity per unit-cost 
for unquantized decoding. (Note that the capacity of discrete-time channels is measured in 
nats per channel use, whereas the capacity of continuous-time channels is measured in nats per 
second. Since with a continuous-time signal of bandwidth W Hz we can approximately transmit 
2W samples per second, we have that one nat per channel use corresponds to 2W nats per 
second. By further noting that lowpass filtering and sampling Gaussian noise of double-sided 
power spectral density N /2 yields Gaussian noise-samples of variance WN , it follows that the 
capacity per unit-cost of the continuous-time channel corresponds to 2W times the capacity per 
unit-cost of the discrete-time channel with a 2 replaced by WN .) 

The above results suggest that, in order to reduce the loss in capacity per unit-cost caused 
by the quantization, one needs to increase the quantizer's resolution. However, while this clearly 
holds for the discrete-time channel, this does not necessarily hold for the underlying continuous- 
time channel. Indeed, in contrast to the unquantized channel output, the quantized output 
is not bandlimitcd, and it is therefore prima facie not clear, whether sampling the quantized 
output at Nyquist rate is optimal. One might thus increase the capacity of the continuous-time 
channel by oversampling the quantized output, i.e., by sampling the quantized output at rates 
higher than the Nyquist rate. 

When there is no additive noise, it was shown by Gilbert [6] and by Shamai [7] that over- 
sampling indeed increases the capacity. In this paper, we demonstrate that oversampling also 
increases the capacity when the noise power is large relative to the transmit power. In partic- 
ular, we show that, for binary symmetric output quantization, sampling the quantized output 
at twice the Nyquist rate yields a capacity per unit-cost that is not less than 0.75^, which is 
strictly larger than the capacity per unit-cost f ~ 0.64^ that can be achieved by sampling 
the quantized output at Nyquist rate. 

The rest of this paper is organized as follows. Section 2 describes the channel model. Section 3 
defines channel capacity and capacity per unit-cost and presents the main result. Section 4 
provides the proofs of the main result. Section 5 concludes the paper with a discussion of our 
results. 



2 Channel Model 

We consider the communication channel depicted in Figure 1 whose input x(-) is bandlimited 
to W Hz and satisfies the average-power constraint P. The channel output y(fcT s ) at integer 
multiples of the sampling interval T s > is 

Y(kT s ) = sgn{((x + Z)*LPF w )(fcT s )}, keZ (4) 

where sgn{-} denotes the sign function; (a*b)(i) the convolution between a(-) and &(•) at time t; 
and LPFw(-) is the impulse response of the ideal unit-gain lowpass filter of cutoff frequency W. 
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Figure 1: System model. 



The hard-limiter is a binary symmetric quantizer that produces 1 for a nonnegative input and 
— 1 for a negative input. We assume that {Z(t), t £ M.} is white Gaussian noise of double-sided 
power spectral density No/2. 

Without loss of optimality, we restrict ourselves to signals x(-) of the form 



x(t) 



2W 



t £ 



(5) 



where g(-) is some unit-energy waveform that is bandlimited to W Hz. Indeed, by the Sampling 
Theorem [8, Thm. 8.4.5], any signal x(-) that is bandlimited to WHz can be written as (5) with 



X£ — x 



— ) , £ £ Z and g(t) = \/2Wsinc(2Wi) , t £ 



3 Channel Capacity and Capacity per Unit-Cost 

We define the capacity (in nats per second) as 



Ct 3 (P)= Jim sup— J(AT;Y?; 
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where the supremum is over all unit-energy waveforms g(-) that are bandlimited to W Hz and 
over all joint distributions on (X\, X2, ■ . . , X n ) satisfying — J2k=i ^ [X%] < P. Here lim denotes 
the limit inferior; is used to denote the sequence A m , A m+ i 1 . . . , A n ; and 
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(with [•] and |_ - J denoting the ceiling and the floor function). A more general definition of channel 
capacity for continuous-time channels can be found in [2, Sec. 8.1]. For the above channel (4), 
the capacity Ct b (P) defined by (6) is a lower bound on the capacity defined in [2, Sec. 8.1]. The 
two capacities coincide, for example, for the continuous-time AWGN channel (without output 
quantization) . 

That oversampling can increase the capacity of the above channel has been demonstrated 
in the noiseless case, i.e., when No = 0. In particular, Gilbert [6] showed that, for a Gaussian 
input X(-), sampling the output at twice the Nyquist rate yields an information rate of 2.14W 
bits per second, which is strictly larger than the 2W bits per second that can be achieved by 
sampling the output at Nyquist rate. Shamai [7] further showed, inter alia, that by sampling 
the output at ?7-times the Nyquist rate, rates of 2Wlog(l +rf) nats per second are achievable by 
transmitting a bandlimited process that possesses a single real zero within each Nyquist interval. 
In the absence of noise it is thus possible to trade amplitude resolution versus time resolution. 

In this paper, we focus on the case where the variance of the additive noise is large relative 
to the transmit power. In particular, we study the capacity per unit-cost, defined as 



C Ts(0 )^ImT^ 
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(7) 
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(where lim denotes the limit superior). By the Data Processing Inequality [9, Thm. 2.8.1] it 
follows that quantizing the output does not increase the capacity. This implies that the capacity 
per unit-cost is upper bounded by the capacity per unit cost of the continuous-time AWGN 
channel (without output quantization) 
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2W, it was shown that the 
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For the case where the output is sampled at Nyquist rate 1/T S 
capacity per unit-cost is given by [3, (3.4.20)] 

• ,,2 1 1 
= -— « 0.637— . 
2W 7T N N 

Thus, when we sample the output at Nyquist rate, hard- limiting causes a loss of a factor oil fix. 
This loss can be reduced by sampling the output at twice the Nyquist rate: 

Theorem 1 (Main Result). Sampling the output at rate 4W yields 
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Proof. See Section 4.2. 
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The main ingredients in the proof of Theorem 1 arc expansions of the complementary cu- 
mulative distribution function (CCDF) of bivariate and trivariate Gaussian vectors around the 
orthant probability. 1 We present these expansions in the following two propositions. 

Proposition 2. Let (x,y) (f> ^(x, y) denote the probability density function (PDF) of the 
bivariate, zero-mean, Gaussian vector of covariance matrix 
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for\g\<l. Then, for every A>0,a£l and f3 € R, 
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where 

|A(A,a,/3)| < A 2 r;(A, a ,/3) 

and where r](A,a,f3) — ry(A, \a\, \(3\) is monotonically increasing in (A, \a\, \ 
for every finite A, a, and (3. 



and is bounded 



1 The orthant probability is the probability that all components of a random vector have the same sign. 
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Proof. See Section 4.1.1. □ 

Proposition 3. Let (x,y,z) \—¥ <Po,k{x, y, z) denote the PDF of the trivariate, zero-mean, Gaus- 
sian vector of covariance matrix 

1 012 013 
K = | 012 1 023 
013 023 1 

for 1 012 1 < 1, | £?i3 1 < 1, I023I < 1 satisfying det(K) > (where det(K) denotes the determinant 
of the matrix K). Then, for every A > 0, a e R, (3 e R, and 7 € R, 
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+ A(A,a,/3, 7 ) (12) 

w/iere 

|A(A,a,/3,7)| < A 2 r/(A, a ,/3, 7 ) 

and w/iere 7j(A, a, /3, 7) = n(A, |a|, I7I) is monotonically increasing in (A, |a|, \(3\, I7I) and is 
bounded for every finite A, a, (3, and 7. 

Proof. See Section 4.1.2. □ 

4 Proofs 

The proof of Theorem 1 is based on the expansions of the CCDF that were presented in Propo- 
sitions 2 and 3. We derive these expansions in Section 4.1. The proof of Theorem 1 is given in 
Section 4.2. 

4.1 Complementary Cumulative Distribution Functions 

We shall first evaluate the CCDF for the bivariate case. The result will then be used to solve 
the trivariate case. 

4.1.1 Bivariate Case 

In order to prove Proposition 2, we express the CCDF as 
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/ <(>o,k(x, y)dydx 
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= / (/) ,K(x,y)dydx + / / 0o,kO, y) dydx + / / <Po,k( x , y) dy dx 
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I 1 pO poo POO pO 

= - + — arcsin(g) + / / <Po,k{x, y) dy dx + / / </> ,k(x, y) dy dx (13) 

4 27T J-aAJ-fiA Jo J-BA 

where the last step follows from the expression for the orthant probability of a bivariate, zero- 
mean Gaussian vector with correlation coefficient g [10], see also [11]. We proceed by evaluating 
the integrals on the right-hand side (RHS) of (13). To evaluate the first integral, we express the 
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joint PDF (x, y) i-> (j) Q ^(x,y) as the product of a marginal PDF f(x) and a conditional PDF 
/(j/k), i-e., 

(j>o,K(x,y) = 4>o,i(x)(f> exA - e 2(y), (iGMjel) 

where x i ^ M . CT 2 (x) denotes the PDF of a Gaussian random variable of mean fi and variance 
a 2 . We thus obtain for the first integral on the RHS of (13) 
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where Q(-) denotes the Q-function 
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Expressing x i-> Q(x) as a Taylor series around zero yields [12, (3.54)] 
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and where <$(•) denotes the remainder term of the Taylor series expansion of Q(-), which satisfies 
[13, Sec. 0.317] 

l6{x)l - ir^' 1x1 - L (17) 

Expressing again x i-> Q(x) as a Taylor series around zero, the first two terms on the RHS of 
(15) are evaluated as 
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It is shown in Appendix A.l that Ai(A, a, f3) satisfies 

|Ai(A,a,/3)| < A 2 m (A,a,p) 

where r?i(A, a, /3) = 771 (A, |a|, |/3|) is monotonically increasing in (A, |a|, |/3|) and is bounded for 
every finite A, a, and (3. 

Along the same lines, we evaluate the second integral on the RHS of (13) as 
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It is shown in Appendix A.l that A2(A, a, (3) satisfies 

|A 2 (A,a,/3)| < A 2 r, 2 (A,a,P) 

where 7y 2 (A,a,/3) = ?7 2 (A, |a|, |/?|) is monotonically increasing in (A, \a\, |/3|) and is bounded for 
every finite A, a, and /?. 

Combining (13), (15), (18), and (19) yields 

/ 4> , K (x,y)dydx = - + TT arcsin(g) + — — +A(A,a,/3) (21) 

where 

A(A, a, /3) = Ai(A, a, /3) + A 2 (A, a, /3). 
By the Triangle Inequality [8, Sec. 2.4], we have 

\A(A,a,p)\ < |A!(A, a ,/3)| + |A 2 (A,a,/3)| 

< A 2 (r, 1 (A,a,/3)+7 ?2 (A,a,/3)). (22) 

Proposition 2 follows now from (21) and (22) by noting that if r?i(A, a, f3) and rj 2 (A, a, (3) are 
both monotonically increasing in (A, \a\, |/3|), then so is 

r]{A, a, 0) 4 ^ (A, a, 0) + »fc(A, a, 

and if ??i(A, a, /3) and 772 (A, a, /3) are bounded, then so is rj{A, a, j5). 

4.1.2 Trivariate Case 

In order to prove Proposition 3, we express the CCDF as 
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where the last step follows from the expression for the orthant probability of a trivariate, zero- 
mean Gaussian vector of covariance matrix [14] (see also [11]) 
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If A = 0, then the integrals on the RHS of (23) are zero and Proposition 3 follows directly from 
(23). To prove Proposition 3 for A > 0, we continue by evaluating the integrals on the RHS of 
(23) separately. To evaluate the first integral, we express the joint PDF (x, y, z) n- 0o,k(x, y, z) 
as the product of a marginal PDF f(x) and a conditional PDF f(y, z\x) 

4> ^{x,y,z) = <t>oA x )<t>n(x),K( x )(y,z), (x € M, y e M, z e M) 

where (y,z) M> m ,k(-)(2/> z) denotes the PDF of a bivariate Gaussian vector of mean fi and 
covariance matrix K, and where 
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where A(A, a, /3) and S(-) are as in the previous section, and where 
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Here the second step follows by substituting 
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and the third step follows from Proposition 2. 
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By applying (18), we obtain for the first term on the RHS of (24) 
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It is shown in Appendix A. 2 that Ai(A, a, /3,7) satishcs 

lA^A,^/?,!)! < A 2 i)i(A,a,/5,7) 

where ?7i(A, a, (3, 7) = ?7i(A, |a|, | — y | ) is monotonically increasing in (A, \a\, |7|) and is 
bounded for every finite A, a, (3, and 7. 

To evaluate the second integral on the RHS of (23), we express the joint PDF (x,y,z) i->- 
<h,K(x,y,z) as 
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+ A 2 (A,a,/3, 7 ) 

£13~ g!2g23 



+ A 2 (A,a,/3, 7 ) 

(27) 



gl3~ g!2g23 ^ 



V(i-e! 2 )(i-^3) 
A 2 (A, a, /3, 7) = -5(/8A) - + — arcsm 



4 2tt 



+ £a^ i(2/) 2^(t! 

+ f m v )a(a,-^L^. . 

■/-/3A \ V 1 - 012 V 1 - 023 



013 - 012 023 I 

v / (i-0i 2 )(i-0i 3 )J 

Q12V 7A + g 23 y 

012 V 71 - 0^3 

012J//A 7 + g232//A\ 



J dy 
dy. 



(28) 
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Here the second step follows by substituting 



, x - g 12 y , , z - Q23V 

x = — , and z = 



V 1 - 023 



the third step follows from Proposition 2; and the last step follows from (18). It is shown in 
Appendix A. 2 that A 2 (A, a,(3,-f) satisfies 

|A 2 (A,a,/3, 7 )| < A 2 r/ 2 (A, a ,/3,7) 

where r/ 2 (A, a, (3, 7) = r]2(A, \a\, \(3\, \^\) is monotonically increasing in (A, \a\, \/3\, \-f\) and is 
bounded for every finite A, a, (3, and 7. 

To evaluate the third integral on the RHS of (23), we express the joint PDF (x, y, z) i->- 
(I>o,k(x, y,z) as 



where 



H(z) 



0o,k(z, y, z) = (f>nAz)(t>^(z)Mz){ x ^ V), (x e M, y e M, z e R) 

1 - 013 012 ~ 013023 



013^ 
023 Z 



and K(z) 



012 — 013023 



We have 

1*00 />oo r-0 



/>oo />oo />U 

/ / / (/) .K(x,y,z)dzdydx 
Jo Jo J- 7 A 

/0 />oo />oo 

004 (2) / / (/) fl ( z ).K(z)(x,y)dydxdz 
- 7 A Jo Jo 

/0 roo poo 

(f)a.i(z) / / (j> K , {z) {x',y')dy' dx' dz 



7 A 



00,1 (*0 



1 1 . / 012 - 013023 

- + — arcsin I — 

- 2tt 



+ / 7A ^ i(z) 27^(tSi 



V / (l-0? 3 )(l-0l3) 
023 Z 



dz 



013 v /l "0i 3 , 



dz 



+ 0o,i (z) A A, '-7S=r 

^-7A V Vi-013 V 1 - 023/ 



dz 



7 A 



0o,i(z)dz + c5(7A) 



1 1 



A 



7 , J_ 
4 2tt 



■ arcsin 



4 2tt 

012 — 013023 



arcsm 



012 - Ql3t 



where 



and 



K'(z) = 



1 



va- 

+ A 3 (A,a,/3,7) 

g!2 — g!3g23 



+ A 3 (A,a,/3, 7 ) 

(29) 



V / ( 1 -e? 3 )(i-e| 3 ) 

7 1 • ; ' 1 .7..:;; ^ 



A 3 (A,a,^ 7 ) = -%A) 



1 1 

4 + 2^ 



+ / 0o,i(*) 



arcsm 



012 — 013023 

V(i-e?3)(i-ei 3 ), 

013 ^ 023-2 



7 A 2V27T \ v/T" 013 V 1 ^ 023, 



dz 



+ 



/ 0o,i(*)A 

J- 7 A 



A,^,-^ dz. 
v V 1 -0l3 V 1 -0i3/ 



(30) 
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Here the second step follows by substituting 

, _ X - Qi 3 Z 



, , y - Q23Z 

and y = : 

V 1 - ^23 



the third step follows from Proposition 2; and the last step follows from (18). It is shown in 
Appendix A. 2 that A3 (A, a, (3, 7) satisfies 

|A 3 (A,a,/3, 7 )| < A 2 r7 3 (A, a ,/3, 7 ) 

where 773 (A, a, (3, 7) = 773 (A, |a|, \(3\, | 7 |) is monotonically increasing in (A, |a|, \(3\, \-f\) and is 
bounded for every finite A, a, (3, and 7. 
Combining (23)-(29) yields 



00 poo poo 



^o.k^j y, z ) dz dy dx 

aA J —f3A J — 7A 

- + — (arcsin(pi 2 ) + arcsin(pi 3 ) + arcsin(p 23 )) 



+ 



a + 8 + 7 a 

H ■ 

4 2tt 



■ arcsm 



023 ~ 012013 

V(i-rf 2 )(i-eO 



2tt 



arcsm 



013 — 012 023 



arcsm 



012 — 013023 



V(i-^ 2 )(i-^ 3 )y 2 ^ wa^a-^) 



+ A(A,a,/3,7) 



(31) 



where 



A(A, a, /3, 7) = Ai(A, a, (3, 7) + A 2 (A, a, (3, 7) + A 3 (A, a, /3, 7). 
By the Triangle Inequality, we have 

|A(A,a,/3, 7 )| < |Ai(A,a,^,7)| + |A 2 (A, a, /3, 7 )| + |A 3 (A, a, /3, 7 )| 
< A 2 (771 (A, a, /3, 7) + 772 (A, a, /3, 7) +773 (A, a, /3, 7)). 



(32) 



Proposition 3 follows now from (31) and (32) by noting that if 771 (A, a, 8, 7), 772 (A, a, /3, 7), and 
773(A) a, /3, 7) are monotonically increasing in (A, \a\, \8\, \-y\), then so is 

7?(A, a, /3, 7) = 771 (A, a, /3, 7) + 772 (A, a, /3, 7) + 773 (A, a, /3, 7); 

and if 771 (A, a, /3, 7), 77 2 (A, a, /3, 7), and ?73(A, a, j3, 7) are bounded, then so is 77(A, a, j3, 7). 

4.2 Proof of Theorem 1 

To prove Theorem 1, we derive a lower bound on C_i_(P) and compute its ratio to P in the 

limit as P tends to zero. To this end, we evaluate (2W)/n/(l{ i ; Y?) for {A fe , fc e Z} being a 
sequence of IID, binary random variables with 



Xu = 



\/~P, with probability \ 

—VP, with probability i. 

We shall restrict ourselves to waveforms g(-) that satisfy 

'I- 1/2 N 



E 

^0 



2W 



E 



E 



2W 

1+1/2 
2W 



< 00 



< 00 



< 00. 



(33) 
(34) 
(35) 
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By the chain rule for mutual information [9, Thm. 2.5.2] 

k—1 

> —l(X k ;Y k ) 

= 2 ^Y,l{X k ;Y k _ h ,Y k ,Y k+h ) 

k=l 



where we dchne 



= 2WI(X I ;Y i ,Y,,Y.) (36) 



Here the second step follows because reducing observations cannot increase mutual information 
and because {X k , k e Z} is IID; the third step follows from the definition of Y^; and the last 
step follows because the joint law of (X k , Y k _ 1 , Y k , Y k+ i) docs not depend on k. 

4.2.1 The Joint Law of (Xi, Yi,Yi, Yj) 

In order to evaluate l{X\\ Yi , Y\, Y|) , we shall compute the conditional probability of 
(Yi,Yi,Y|), conditioned on X\. To this end, we hrst compute the conditional probability 
of (Yi,Yi,Y|), conditioned on X™^, and then average over (X^^, X%°). 

To compute Pr(Yi = 1, Yi = 1, Y| = 1 | X 00 ^ = x™^), we hrst note that by (4) 

Y r = l ^ ^ X/fl ^Z_j+Z T >0, rel (37) 



where we dchne 
Let 



^ = (Z*LPF W )(^), r 



( CO N A 1 X- A/ 2 -^ 



and 



V /P(2W)(WN ) £ ^ X " 5 V 2W 
P{X ^ ] ~ VP(2W)(WN ) J^ 5 (W 



/ oo \ a 1 ^ ^ 3/2-^ 

7 ^°° j - V /P(2W)(WN ) A*'H 2W J 



V /P(2W)(WN ) /= 
It follows from (37) that 
Pr(n = 1, Y 1 = 1, Ys = 1 | = 



= Pr(Zi > -a(x^ oo ) y /?(WN ),Z 1 > -^(x^) y/P (WN ),Zs > -7(^-00) VP(WN )) 
= Pr I ; 1 Zi > -aix^Vf, - 1 Zi > -^(x 00 ^)^, - 1 Z3 > ^(x 00 ^) VP ) 

/OO /'CO /'CO 

_ / <j> 0iK (x,y,z)dzdydx 



1 1 • / N 

= g + ^ arcsm(p) + 



(x- 0O )+/3(x- co )+ 7 (x- co ) 



H — arcsin — arcsm 



2ir ~~"'~y^/T^2j 2tt \l~P 2 

+ A (VP, a{x^), 0(x^), j{x^)) (38) 
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where 

1 p 
pip 
p 1 

Here the third step follows by noting that {Z T , r e R} is a zero- mean Gaussian process of 
autocovariance function r i->- WN sinc(r); and the last step follows from Proposition 3. 
Since {Xk 1 k e Z} is IID and of zero mean, we have 



, ' (2W) ( WNo) " V 4W7 '" V /P(2W)(WN ) ^ E ^ ^ 



1/2-1 
2W 



V /(2W)(WN ) V 4W 



and 



/?o = E 


>(*~oo) 


Xi = VP 


7o = E 


7(*~oo) 


x x = Vp 



1 



V(2W)(WN ) 
1 



3(0) 



V /(2W)(WN ) V 4W 
By setting X\ = VP and averaging over (X®^, X%°), we thus obtain 

Pr(Yi = l,Yi = 1,Y| = 1 I Xi = VP) 

"a(X- co )+/?(X- co )+ 7 (X- co ) 



(39) 

(40) 
(41) 



1 1 

8 + 27 



arcsin( i o) + E 



, <*(*-oo) +7(^oo) . 

H — arcsm 

27T 
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2tt 



arcsm 



1-ff 



x 1 = Vp 



A (V?,a(X^ OD ),(3(X^ OD ), 1 (X^ OD )) \x 1 = VP 



11 . , , 
- + — arcsm( ( o) + \ — 



8 2tt 



2tt 



Qq + /3q + 7o 
4 

. "o+7o 



■ arcsm — , arcsm 

2tt I J 2n 



1-p* 



+ E 



We next show that 



A (y/?M x -°o),P( x ?oo)M x -ooj) J Xr = VP] = o(VP) 



(42) 



(43) 



where x h-» o(x) satisfies lim^o o{x)/x = 0. To this end, we use the Triangle Inequality to upper 
bound 



H X -oo) < 



1 



- Y \xA 

VP(2W)(WNo),£- co l 



1/2- I 
2W 



V(2W)(WN ) 



E 

^=-00 



/l/2_/£ 
I 2W 



"n 
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which, by (33), is finite. Here the second step follows because, for our choice of {X k , fceZ}, 
we have with probability one |Xfe| = V?. Similarly, we have 



|0(*-oo)|< 



1 



V(2W)(WN ) 

A B 

— Pmax 
< OO 



E 



1-1 

~2W 



and 



|7(*~oo)|< 



1 



V /(2W)(WN ) ,f 

— 7max 
< OO. 



E 



3/2 
2W 



We thus obtain 



< EfjA^a^),/?^)^^))! I X 1 = V? 

< P (y?, a max , /3 max , 7 max ) 
= o(VP) 



(44) 



where the first step follows from the Triangle Inequality; the second step follows because 
A(A, a, /3,7) < A 2 r](A, a, (3, 7); the third step follows because r;(A, a, 8, 7) = r?(A, |a|, | — ^ j ) 
is monotonically increasing in (A, |a|, |/3|, |-y | ) ; and the last step follows because r;(A, a, (3, 7) is 
bounded for finite A, a, 8, and 7. This proves (43). 
Combining (42) and (43) yields 

Pr(Yi = 1, Fi = 1, Y| = 1 I Xi = V?) 
1 1 



8 2tt 



arcsin(p) 

ao + A+7o , "o + 7o 

- H arcsm 



4 2tt yv^p 2 

By averaging (38) over X^^, and by noting that 



- — arcsm . 

Z7T \ 1 — p 2 



and 



Efa^J] = E^X-J] = £[7^)] = 
A^a^-J,/?^)^^))] =o(VP) 



+ o(VP). (45) 



(46) 



we obtain for the unconditional probability 



Pr(n = 1, Y x = 1, Ys = 1) = I + ^- arcsin(p) + o(Vp) . 

2 2 o Z7T 



(47) 



It thus follows from Bayes' law that 

Pr(Xi = Vf I Yi = 1,Y! = 1,Y| = 1) 

- + HlLhlo arcsin 



1 

= 2 + 



^ arcsin ( T ^)+ (VP) 



2tt 



a +j3o+7o + ££1±70 ^ _____ 

P 4 2 *- V\/W 



i arcsin(p) + o(\/P) 



^ arcsm | 



2tt 



i + i arcsin(p) 



+ o(VP) (48) 
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where the last step follows because ~ f + °( x ) f° r an y £ 7^ and i9 e R. Note that the 

first two terms on the RHS of (48) depend on . . . , X-i, Xq, X2, X3, . . . only via their means. 
Thus, if {Xk, k e Z} is of zero mean, then intersymbol interference affects only the o(\/P)-term, 
which does not influence our lower bound on the capacity per unit-cost. 

The probability Pr(Yi = 1, Yi = 1, Ya = -1 | X x = VP) can be computed in a similar way. 
We have 



-1 X c _ 



Pr(Yi = 1, Yi = l,Ys 

v 2 2 

/oo />oo /■- 

-a(x°°„)\/P ^-/3(x°°„)\/P i- 



= T°° "I 
X) ^ — ooj 

7(x~oo)V^ 



b 0tK (x,y, z)dz dy dx 



/oo /*oo /'OO 

)\/P J-P(x<?> )V? J-l( 



t>o,G(x,y) dy dx 



1 1 • / N 

= — I arcsin p 

4 2tt 

11 • f \ 

~ 8 ~ 2^ arcsm ^) 



a^oJ+^oo) /P 



Vk(2;,2/, z)dz dy dx 



- + A(VP,a( 2 ;- co ),/3(x!f 0O )) 



cfo-oo)+/fo-°o)+7fe,) 



, "(g-oo)+7(g°., 

H arcsin 

Z7T 



-00 J 



OO \ 

OO / 



2tt 



■ arcsin 



= - + 



«(*~oo) +0(^00)- 7(^-oo) 



2tt 



arcsin 



+ 



0(a 



2tt 



■ arcsin 



+ A (VP, a^), - A (VP, 

where K is the same as in (38), and where 

G = 

Here the second step follows because 

6o,k(z, y, z)dz = (f> ,G(x,y), (x eKje R) 



(49) 



1 P 
P 1 



P = 



f 

j — i 



and the third step follows from Propositions 2 and 3. By setting Xi = \/P and averaging over 
(X^^X?), we obtain 



Pr(Fi = 1, Yi = 1, Ys=-l\X 1 = V?) 



+ Po ~ 7o _ a o + 7 o 
4 2tt 



arcsin 



. A> 

+ — arcsm 

27T 



+ E 

- E 



(50) 



where ao, /?o 3 an d 70 are defined in (39)-(41). It was shown above that the last term on the 
RHS of (50) decays faster than VP (43). Repeating the steps in (44), it can also be shown that 



a (y?M^co),^co)) \x, = V?]= o(Vp). 
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By noting that o(\/P) - o(VP) = o(v / P), we thus obtain 
Pr(Yi = 1, Yi = l,Ys = -1 | Xx = VP) = \ 



+ 



ao + flo - 7o a + 70 



■ arcsm 



00 

2tt ' 



■ arcsm 



4 2tt 

It follows from (46) that the unconditional probability is given by 

Pr(n = i,r 1 = i,r | = -i) = i + (Vp). 

By Bayes' law, we thus have 

Prpfi = VP | Yi = 1, Yi = l,Y f = -1) = \ 
+ A) - 7o + 7o 



+ o(VP). (51) 



(52) 



+ 4i 



■ arcsm — , H arcsm 

2tt I V^7) 2lr 



+ o(\/P). (53) 



The other conditional probabilities can be computed along the same lines. The probabilities 
corresponding to (Yi = 1, Y\ = —l,Ya = l) and (Yi = — l,Yi = 1, Ys = l) are given by 



Pr(Xi = V?\Yi=l,Y 1 = -l,Ys = 1) 

_ Qq — ft) +70 _ »o+7o 



217 



arcsm 



+ arcsin 



2tt 



i l 



i arcsin(p) 



+ o(VP) (54) 



and 



Pr(Xi = VP I Yi = -1, Y = 1, Y § = 1) = \ 



+ 4\ 



-a Q + /3 + 7o + 7o 



2tt 



■ arcsin 



Po_ 
2tt ' 



■ arcsin 



1-p- 



+ o(VP). (55) 



The remaining probabilities can be computed from (48), (53), (54), and (55) by noting that, due 
to the symmetry of </>o,k(-)i 

Pr(Xi = >/P| Yi =yi,Yi=yi,Y| = ys) = Pr(Xi = ->/P | Yi = — j/i , Yi = -j/i, Ys = -ys). 

4.2.2 Evaluating J(X i; Yi,Yi,Y|) 
Let 

= Pr(Xi = VP | Yi = j/i, Y = t/i,Y| = j/a). 
We express the mutual information l{X\; Yi , Yi, Y3) as 



7 X i; Yi,Yi,Ys =H(X 1 )-H(X 1 Yi,Y,Y3 



log2-E 



ff 6 (p(Y | ,Y 1 ,Y | 



(56) 



where Hb(p) = —p\ogp — (1 — p) log(l — p), < p < 1 (with OlogO = 0) denotes the binary 
entropy function. To evaluate the binary entropy function, we express -ff&(-) as a Taylor series 
expansion around |, i.e., 



(57) 
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Substituting £ = p(y 1,2/1,2/3) — 1/2 and averaging over (Yi , Yi, Yi) yields thus 



/(X^Yi^Ys) =log2-E 



log 



2 - 2 (p(Yi , Yi , Ys) - 1) + o M p(Yi , Yi, Ys 



= 2E 



2E 



(p(Yi,Yi,Ys)-i 



+ o(P) 



(58) 



where the last step follows because, by (48), (53), (54), and (55), we have for every (2/1,2/1,2/3) 



p{y±,yi,y%) - 

where x ^ O(x) satisfies lim^olC^)/^! < 00, so 



O(P) 



o ^p(2/i,2/i,2/|) - ^ ^ =o(P) 



which implies that 



o p Yi,Yi,Ys 



o(P) 



since the expectation is given by the sum of eight terms that all decay faster than P. 
By applying (48), (53), (54), and (55) to (58), we obtain 

7(Xi;Yi,Yi,Y3) 

v 2 2 ' 



Qo+/3q+7o 1 "0+70 arr . Q ;r, _ 



- ft arcsin v 1-p2 



i + i arcsin(p) 



4 arcsin 

\ 4 2tt 

Qq— /3q+7o _ »o+7o 



A . ( P 2 

, f — arcsm - 

;± I 2tt \ 1 /'• 



+ 



arcsin . . 



+ ^ arcsm , 



i - i arcsin(p) 



/ -a + A) + 7o + 7o 

+ 4 arcsin 

\ 4 2tt 



P \ , A 

— , + — — arcsin . 

2tt vi-p 2 



o(P)(59) 



where a , A, and 70 are defined in (39)-(41). 

Combining (59) with (36) and (6), and computing the ratio to P in the limit as P tends to 
zero, yields the lower bound on the capacity per unit-cost 



Ci(0)>^ 



ao+fto+70 1 "Q+70 pTpoin 



t arcsin (r^) 



I 1 1 

4 



^ arcsin( i o) 



/ a Q + A - 70 a + 70 
+ 4 i — arcsm 



p jo . / r 

— ; + ^— arcsin ^ 

a/wv 2tt \i-p\ 



17 



+ 



Qq— /3q+7o 
4 



ao+7o 
2tt 



arcsin 



§0 

2-n 



arcsin 



i - i arcsin(p) 



/ -a + Po + 7o + 7o 
+ 4 arcsin 

\ 4 27T 



2tt 



= i?(0). 



(60) 



Note that this lower bound holds for all unit-energy waveforms g(-) that are bandlimitcd to W 
Hz and that satisfy (33)-(35). In the following section we evaluate the RHS of (60) for a specific 
choice of <?(•). 



4.2.3 Choosing a Waveform 

Any choice of g(-) satisfying the above conditions yields a lower bound on C i (0). Here we shall 
find the waveform that gives rise to the largest lower bound. Thus, we wish to maximize the 
RHS of (60) over all unit-energy waveforms g(-) that are bandlimited to WHz and that satisfy 
(33)-(35), namely, 



E 



E 



1/2 



2W 



\2W 



£+1/2 
2W 



< oo 



< oo 



< CO. 



To further facilitate the optimization problem, we choose g(-) to be a symmetric function, which 
implies that a = 70 • (Numerical computation suggests that this choice is indeed optimal.) In 
this case, the rate per unit-cost R(0) is of the form 

aal + 2ba /3 a + c/3g 

(where a > 0, c > and ac — b 2 > 0), so i?(0) is a convex function of (ao,/?o), and all pairs 
(ttO)/?o) that give rise to the same rate per unit-cost lie on an ellipse. Let Q denote the set of 
all waveforms satisfying the above conditions, and let B denote the set of all pairs («0)/3o) that 
arise from these waveforms. Since R(0) is convex in (chojA)), it follows that the supremum of 
i?(0) over B lies on the boundary of B, which we shall determine next. 

While the set of all bandlimited functions that satisfy (33)-(35) is convex, the set of all unit- 
energy functions is not (see [15, Sec. 2] for a definition of a convex set). Indeed, if for example 
the functions gi(-) and ^O) are of unit energy and satisfy g\(t) = — g 2 (i), ieK, then 

t->&i(t) + (i-0s2(*). £e[o,i] 

is of energy (2£ — l) 2 , which is not equal to 1 unless £ — 1 or £ = 0. Nevertheless, the set of all 
functions whose energy is less than or equal to 1 is convex, since 



Il£gi + (i-?)g2|| 2 <(£l|gi|| + (i-£)l|g 2 | 



< 1, 



:|gi|| 2 <l,||g 2 || 2 <Ue[0,l]) 



where ||g|| = \J j (g(t)) 2 dt, and where the inequality follows from the Triangle Inequality. Let 
Q' D Q denote the set of all waveforms that are bandlimited to W Hz, that satisfy (33)-(35), 
and whose energy is less than or equal to 1. Furthermore, let B' D B denote the set of all pairs 
(ao,Po) that arise from these waveforms. Since convexity is preserved under intersection [15, 
Thm. 2.1], it follows that the set Q' (and hence also £>') is convex. We further have that 

sup7?(0) = sup7?(0) = sup7?(0) 

Q Q' B> 
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where the first step follows because every rate per unit-cost R(0) that corresponds to a waveform 
satisfying ||g|| < 1 can be increased by normalizing the waveform; and where the second step 
follows because i?(0) depends on g(-) only via (ao,/?o) and because Q' determines £>'. The above 
optimization problem can thus be expressed as the maximization of a convex function over a 
convex set. 

Let B' denote the closure of B' (i.e., let B' be the smallest closed set containing £>') and let 
A i y ((A) be defined as 

C(A)= sup |Aa + A)|, Ael. (61) 

(ao,/3o)eS' 

Since B' is a closed convex set, it follows that the boundary of B' is given by the points (ao, /?o) € 
B' that achieve the supremum in (61) [15, Sec. 13]. Note that the boundary of B' is symmetric 
with respect to the origin, since (ao,flo) and (— ao,— A)) yield the same value for |Aao + /?o|, 
A e R. 

For every A € I, the supremum in (61) is achieved by the pair (ao,/3o) that corresponds to 
g(-) being of Fourier Transform 

g(f) = = , V ' I {|/| < W} , /Si (62) 



where I{-} denotes the indicator function, i.e., I {statement} is 1 if the statement is true and 
otherwise. This yields 

1 - + -A 

a = -== * 2 Ael (63) 

VWN^^/i A2 + | A + 1 

and 

1 1 + ^A 

0o= = , Age. (64) 



V^ + lA + l^ 



Note that the waveform given by (62) is of unit energy and is bandlimited to W Hz, but it does 
not satisfy (33)-(35). Nevertheless, it is shown in Appendix B that there exist pairs (ao, /?o) € B' 
that are arbitrarily close to (63) and (64). The pairs (a ,f3 ) given by (63) and (64) are thus 
in the closure of £>' and the boundary of B' is hence parameterized by (63) and (64). (Notice 
that (63) and (64) describe only one half of the boundary of B' . The other half is given by 
(-a ,-A))-) 

The above problem is illustrated in Figure 2 for W = No = 1. The outer curve (solid line) is 
the contour line corresponding to R(0) ~ 0.747. The inner curve (dot-dashed line) depicts the 
boundary of B' as parameterized by (63) and (64). The two curves touch at two points: one 
corresponds to A = 1.4 and the other is the same point reflected through the origin. Since R(0) 
is convex in (ao,/?o), it follows that any point that lies inside the depicted contour line (solid 
line) yields a rate per unit-cost that is smaller than 0.747. Thus, the two points on the boundary 
of B' that touch the contour line yield i?(0) « 0.474, whereas the other points (a ,(3 ) € B' yield 
a rate per unit-cost that is smaller. Therefore, we conclude that choosing (ao,/3o) according to 
(63) and (64) with A = 1.4 maximizes R(0). 

In the following, we summarize the above arguments to compute the supremum of i?(0) over 
the set B' for general W and No- To this end, we first recall that R(0) is convex in (ao,/?o) 
and it therefore suffices to maximize i?(0) over all boundary points of B' . We next use that 
every boundary point of B' is given by the parametric equations (63) and (64). Therefore, by 
applying (63) and (64) to (60), the maximization of R(0) over the set B' can be expressed as 
the maximization over the real scalar A 



C i (0) > sup--i- 



+ ^ + -^iarcsin 



\ + \ arcsin(p) 
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PW «(A) 



fi(A) _ _ 5(A) 



arcsm 



p 



, /3(A) 



1-p? 



arcsm , . 



\~\ arcsin(p) 



(65) 



where 



and 



*(A) = 



7T 2 



1 + 4A 



2^ + ^ + 1 



A e 



A e 



Numerical computation shows that the supremum on the RHS of (65) is achieved for A = 1.4, 
which is consistent with our interpretation of Figure 2. The capacity per unit-cost (7 i (0) is 
thus lower bounded 



C(0)> 



31 4- so _i_ m_ 

2 ' 4 ' 7T 



¥ arcsin (t^) 



\ + i arcsin(p) 



4 7T Vv/l^, 



_ a _ Si arcs in 

Z 4 7T 



3o ■ P 
— arcsin - 

2tt V 1 - P 



I 2 - arcsin 



i - i arcsin(p) 



0.747 



(66) 
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where 



9i 



^ 2 + |A- 



This proves Theorem 1. 



and 



90 



A=1.4 



A=1.4 



5 Summary and Conclusion 

We demonstrated that doubling the sampling rate recovers some of the loss in capacity per unit- 
cost incurred on the bandlimited Gaussian channel with a one-bit output quantizer. Indeed, 
when the channel output is sampled at Nyquist rate 2W, it is well-known that the capacity per 
unit-cost is given by w 0.64-^ [3], which is a factor of | smaller than the capacity per 

unit-cost of the same channel but without output quantizer. We showed that, by sampling the 
output at twice the Nyquist rate, a capacity per unit-cost not less than 0.75-jq— can be achieved. 
This can be viewed as a very- noisy counterpart of the work by Gilbert [6] and by Shamai [7], 
which demonstrated that oversampling increases the capacity of the above channel when there 
is no additive noise. 

The conclusions that can be drawn from this result are twofold. Firstly, we demonstrated 
that in order to reduce the loss in capacity per unit-cost caused by the quantization, one can 
cither increase the quantization resolution or the sampling rate. Thus, it is possible to trade 
amplitude resolution (quantization) versus time resolution (sampling rate) . Secondly, we observe 
that while sampling the output at Nyquist rate is optimal for the AWGN channel (without output 
quantization), this does not hold when the output is quantized. Thus, a communication scheme 
that is optimal in the sense that it achieves the capacity need not be optimal anymore if the 
channel output is processed by a noninvertible operation (such as quantization). 
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A Appendix to Section 4.1 

A.l Bivariate Case 

We show that, for A > 0, a € R, and f3 <G R, we have 

|Ai(A,a,/3)| < A 2 m (A,a,P) and | A 2 (A, a, p)\ < A 2 7? 2 (A, a, /?) 

where Ai(A, a, (3) and A 2 (A,a,/3) are defined in (16) and (20), and where r?i(A, a, f3) — 
?7i(A, \a\, \0\) and 7/ 2 (A,a,/3) = r]2(A, \a\, \(3\) are monotonically increasing in (A, |a|,|/3|) and 
are bounded for every finite A, a, and (3. We have 

|Ai(A,a,/3)| 
\0\A 




< 



V2tt(1 - Q 2 ) 

HI/31 
n/2tt(1 - (?) 
A 2 m (A,a,p) 



A 2 + 



V2tt(1 - g 2 ) Jo 

v 2 



\q\ot 



y/Ml ~ Q 2 ) 



=A 2 



H\P\ + \p\H) A , 

6^(1 -£» 2 ) 5 



12V27T 



A 



(67) 
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where 



r?i(A, a, /3) = 



M« 2 



^2tt(1 - g 2 ) ^/2tt(1 - g 2 ) 6v^tt(1 - £ 2 )^ 



al) , 
17 A 2 



12V2tt 



A. 



Here the first step follows from the Triangle Inequality; the second step follows by upper bound- 
ing (f>o,i(x) < 1, x £ R and from (17); and the third step follows because, over the range of 
integration, x < \a\A and \/3A + gx\ < + \g\ \a\)A. 
Along the same lines, we obtain 



|A 2 (A,a,/3)| 
\q\ 



< 



< 



VMi - Q 2 ) 
\q\ 

VMi - g 2 ) 

A 2 m (A,a,(3) 



r° 

/ <t>o,i(y)\y\dy 

J-fiA 

/ ydy + 
Jo Jo 



/3A 



00,1 (y) 



QV 



(\e\yY 



lfl|3 



dy 



+ ^\S(PA)\ 



where 



r) 2 (A,a,/3) 



\e\P 2 



6\/27(l - g 2 ) : 



■dy + 



12V2tt 



A 1 



This proves the claim. 



y^l - g 2 ) 6\/2^(l - g 2 )l 12>/fcr 



(68) 



A. 2 Trivariate Case 

We show that, for A > 0, a e M, j3 e M, and 7 e K, we have 

|Ai(A,a,/3, 7 )| < AV(A,a,/3,7) 
|A 2 (A,a,/3, 7 )| < A 2 r; 2 (A, a ,/3,7) 

and 

|A 3 (A,a,/3, 7 )| < A 2 r7 3 (A, a ,/3,7) 

where Ai(A, a, (3, 7), A 2 (A, a, /3, 7), and A3 (A, a, /3, 7) are defined in (25), (28), and (30), and 
where 771 (A, a, ft, 7) = 771 (A, |a|, |/3|, I7I), rj 2 (A, a, /3, 7) = 772 (A, |a|, |/3|, | 7 |), and 773 (A, a, (3, 7) = 
773(A, \a\, \/3\, I7I) are monotonically increasing in (A, \a\, |/3|, I7I) and are bounded for every finite 
A, a, (3, and 7. We have 



1 1 

- + — arcsin 

4 27T 



lAx^a,/?^)! < |5(aA)| 

,0 , 



+ 



1 



a A 


aA 



4>os(x) 



2tt 
A (A, 



£> 2 3 — £l2f?13 
vV /(l-^ 2 )(l-&) 

/3A + gi 2 a; 7A + pi 3 :r 



da; 



P + Q12X/A 7 + g 13 x/A \ 



dx 



S > A)l + 2i 



1 



Q A 2V27T 



/3A + gi 2 x 7A + g 13 x 



+ 



< 



12V27T 



aA 



A 



( P + gi 2 x/A 7 + g 13 a;/A ^ 



V 1 - QI2 \A - els 

dx 



dx 



1 



2V27T 



|a|A 



+ A' 



,|a|A / 



|/3|A+|gi 2 |a; | 7 |A+|g 13 |a; 
+ |ei2|a;/A |7| + |£>i 3 |ar/A N 



dx 



dx 
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a" q a 
< 1 _ A 3 + 



12 v / 2^ : 2\/2^ y y/l - Q \ 2 

( \R\ A- I rii ol Irvl l-vl 

+ aij A 



Iffl2| |a| , ItI + 1 013 1 H \ A 2 



+ I £«i2 1 \a\ 111 + 1 013 1 M \ a3 



012 



013 



A 3 



AV(A,a,/3,7) 



(69) 



where ry(A, a, /3) is as in Proposition 2, and where 

U|3 \a\ / 1/3| + |ei 2 | |a| , | 7 | + \qi 3 \ M 



7/i (A, a, P, 7) = 



=A + - 



12-S/27T 2\/2T y ^1 - V^-013 
-|0i2||a| 111 + 1 013 1 1 a | 



+ mv a,- 



012 ' V 1 - 013 



A. 



Here the first step follows from the Triangle Inequality; the second step follows by upper bounding 
0o,i ( x ) < 1, x € K and arcsin(a;) < 7r/2, |ie| < 1; the third step follows again from the Triangle 
Inequality, from the upper bound 

\A(A,a,p)\ < A 2 V (A,a,P) 

and from the monotonicity of rj(A, a, j3) in (A, |a|, |/3|); and the fourth step follows because, over 
the range of integration, x < \a\A. 

Along the same lines, we obtain for A 2 (A, a, /3, 7) 



|A 2 (A, a ,/3, 7 )| < \8(pA)\ 

+ 



1 1 / 013 - 012023 

- + — arcsin — : 

4 ^ W(l- 0^(1-013), 



0122/ 7 A + 023^ 



V 71 - 012 V 71 - 02-3 



dy 



/ 00,1 (2/) 

J-f)A 



< 



A f A gl2?/ / A 7 + 023^/A \ 

|012||/3| + | 7 | + |023||/9| \ 



12>/fcF 2V27 Vv/i-012 v 71 - 023 / 

+ I,I,( A , W^,MJ±sM A , 

V V 1 -0l2 V 1 - 023 / 

A 2 77 2 (A,a,/3, 7 ) 



(70) 



where 



ry 2 (A,a,/3, 7 ) = 



=A 



I 012 I 



H + MlM 

I2V2T' ' 2V27 ^ \/l - 0? 2 V 1 - 023 / 

V V 1 -0i2 V 1 - 023 / 



Finally, we obtain for A3 (A, a, /3, 7) 
|A 3 (A, a ,/3, 7 )| < |<5( 7 A)| 



1 1 . / 012 - 013023 

— I arcsin — : 

4 ^ \ x/(l-01 3 )(l-0l3), 



1 



7 a ' ^ ^2\/27T 



013^ 023^ 



/ 00,1(2) 



A A, 



V 1 ~ 013 0-023 
013-z/A g 2 3^/A 

\/i - 013' \/i - 023 y 



dz 



d2 
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where 



" 12V2^ 2^ \y 1 - ^ 3 VI -023/ 

= A 2 % (A,a,/3, 7 ) (71) 



12^ 2V2^\ V VT _ yi3^2-^ 



This proves the claim. 



B Appendix to Section 4.2 



We show that there exist bandlimited, unit-energy waveforms g(-) that satisfy (33)-(35) and 
that give rise to pairs (ao,Aj) that are arbitrarily close to 



1 



rr 2 



VWn a /±A 2 i 



A + l 



and 



1 + A| 



A e 



A g 



(72) 



(73) 



VWN^i A2 + i A + 1 ' 

To this end, we first note that a sufficient condition for the waveform g(-) to satisfy (33)-(35) is 

K 2 



\g(t) <Ki, (6 



and 



t 2 ' 



> T 



(74) 



for some nonnegative Ki, K 2 , and some positive T. Indeed, if (74) holds, then we have for every 



E 



~2W 



< 



E 

£=-oo 



£ + T 

2W 



2W 



- E 

|^w|<T+^v 

< 2Ki (2WT+1) + 

H 

< 2Ki (2WT+l) + 2 



E 



E 

|w~| >T +2W 

K 2 (2W) 1 

2W 

K 2 (2W) 2 



2W 



/ 2 wt+i - iy 

2W 

2Ki (2WT+1) + 2 Ka- 



di 



< oo 



(75) 



where the first step follows because \g(r/(2W)) | is nonnegative; the third step follows from (74) 
and because there are not more than 2WT+ 1 terms satisfying \(£ + r)/(2W)| < T + 1/(2W); 
and the fourth step follows by upper bounding the sum by an integral. 



24 



The waveform g(-) corresponding to the Fourier Transform (62) is given by 

g(t) - 



2W 



TvJ : ix^y ( sinc ( 2W *) + \ sinc ( 2Wt ~ I) + ^ sinc ( 2W/ 



tel. (76) 



We note that the sum of the last two terms on the RHS of (76) equals 

A 



- sinc ( 2Wt - - ) + - sinc ( 2Wt + - 
2 V 2/2 V 2 



2' 

A cos(27rWt) 
I _ 2,((2Wt)^i) 



t - 



which satisfies (74) and hence also (33)-(35). However, 1 1-> sinc(2Wi) decays like 1/t and does 
neither satisfy (33) nor (35). We therefore replace t \-t sinc(2Wi) in (76) by a raised-cosine 
pulse of bandwidth W Hz and of roll-off factor £ to obtain the desired waveform 



9&) 



2W 



1_ / 2WA cos (n2Wj^t 
^\ 1 + Z SmC ^ + z)l-4(2W^t) 2 



- sinc [2Wt - - 
2 V 2 



- sinc [2Wt + - 
2 V 2, 



(77) 



where 



(0 = 2\vJ 



( 2Wt\ cos ( 7r2W I+?' 
— sinc 



+ - sine [ 2Wt - - | + - sinc [ 2Wt + - 
2 V 27 2 V 2, 



dt. 



For every £ > 0, the raised-cosine pulse on the RHS of (77) satisfies (74) and hence also (33)- 
(35). It therefore follows from the Triangle Inequality that also the pulse g^(-) satisfies (33)-(35), 
since for every te R 



!*(*)!< 



' 2W 



1 



7r2W T | | t 



+ 



1 2W 



( 2Wt\ cos ( 
— - sine 

+^ vi + e; i _ 4 ( 2W _i_ 



^ sinc (2Wi - i j + ^ sinc | 2\V I 



Furthermore, since g^(-) is of unit energy and bandlimitcd to W Hz, it follows that <?{(•) is in 
Q' , and hence the pairs (ao,flo) that arise from t^(-) arc in B' . 

It remains to show that such (ao, j3o) approach (72) and (73) as £ tends to zero. To this end, 
we show that for every fet 



(78) 



This implies 



and 



lim an = lim 



ao v /(2W)(w^N )' 95 V 4VV 



lim /3 = lim — ^=Qf (0) 

ao K ao ,/(2W)(WNo) 



VWN^i A2 + | A + 1 
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which in turn proves the claim. 



To prove (78), we note that 1 1-> ^^f(^)g^(t) — which we shall denote by 1 1-> h^(t) — satisfies 

(79) 



lim he it) 



I A 2 + ^A + !),/(/). 



f e 



where g(-) is the Inverse Fourier Transform of (62). Furthermore, since the Fourier Transform of 
h 6 {-) satisfies |^(/)| 2 < (l/2 A 2 + 4/ir A + l)\g(f)\ 2 , / el (with g{-) given by (62)), and since 
/(l/2 A 2 + 4/7T A + l) \g(f)\ 2 df < oo, it follows from the Dominated Convergence Theorem [16, 
Thm. 1.34] that 



df 



lim 



k(f) 



d/ = iA 2 + -A + l (80) 

Z 7T 



where the second step follows from Parseval's Theorem [8, Thm. 6.2.9], and where the last step 
follows from (79) and because g(-) is of unit energy. It thus follows that for every teR 



iim*(t) = iim -ML = l ^mM = m 



(81) 



where the last step follows from (79) and (80). This proves (78) and hence also the claim. 
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